Data models for an integrated thesaurus database

نویسنده

Dagobert Soergel

چکیده

This paper presents two data models for storing multiple thesauri in a single integrated database to be used as an aid to searchers in multi-database searching, for the construction of conversion tables between thesauri, and as a tool for constructing and maintaining individual thesauri. The paper first describes the nature of thesaurus data and a relational data structure for such data, which is flexible and — through its use of term numbers in recording relationships — economical in storage. It then describes two data models for structuring an integrated thesaurus database. In both models, general data on terms and relationships are stored once, with indication of one or more sources, resulting in storage economy. The term-based model stores all relationships as relationships between terms. This is flexible but redundant: If the same concept relationship is expressed through different terms in different thesauri, it is stored multiple times in the integrated database. The concept-based model identifies concepts by concept numbers and uses these concept numbers to record concept relationships, thus bringing together all occurrences of the same concept relationship regardless of the terms used to express the related concepts. This results in more compact storage but is less flexible.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی مقایسه‎ای روابط معنایی، ساختار شکلی و سیستم مدیریت اصطلاحنامه‎های فنی ـ مهندسی و نما

Purpose: Thesauri as important tools in storage and retrieval information systems have a significant role in the optimization of database search. So the publishing of thesauri needs to use standards as much as possible. I examined and compared two important thesauruses on the basis of ANSI/NISO z39.19 2005. Methodology: This study is an analytical and applied survey. The study population was t...

متن کامل

وضعیت بازیابی اطلاعات در دو پایگاه نمایه و نما و سنجش اثربخشی استفاده از واژگان کنترل ‌شده در نمایه‌سازی این دو پایگاه

Purpose: This study was carried out to determine the level of precision, recall, and searching time for “Nama” and “Namayeh” databases, as well as to find out which of the indexing tools (thesaurus and Dewey decimal classification) helps us more in improvement of information retrieval. Methodology: This study is an analytical survey in which the necessary data was collected by direct observati...

متن کامل

Creating and Querying an Integrated Ontology for Molecular and Phenotypic Cereals Data

In this paper we describe the development of an ontology of molecular and phenotypic cereals data, realized by integrating existing public web databases with the database developed by the research group of the CEREALAB project. This integration is obtained using the MOMIS system (Mediator envirOnment for Multiple Information Sources), a mediator based data integration system developed by the Da...

متن کامل

Thesaurus-Based Software Environments

Software environments support the process of constructing and maintaining application systems. This paper describes the idea of a thesaurus1 as a viable foundation for software environments. A thesaurus contains information about the names and identifiers in all the software written in all the languages of an application. Information about extensional data in a database or persistent store is a...

متن کامل

Prediction of global sea cucumber capture production based on the exponential smoothing and ARIMA models

Sea cucumber catch has followed “boom-and-bust” patterns over the period of 60 years from 1950-2010, and sea cucumber fisheries have had important ecological, economic and societal roles. However, sea cucumber fisheries have not been explored systematically, especially in terms of catch change trends. Sea cucumbers are relatively sedentary species. An attempt was made to explore whether the tim...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Data models for an integrated thesaurus database

نویسنده

چکیده

منابع مشابه

بررسی مقایسه‎ای روابط معنایی، ساختار شکلی و سیستم مدیریت اصطلاحنامه‎های فنی ـ مهندسی و نما

وضعیت بازیابی اطلاعات در دو پایگاه نمایه و نما و سنجش اثربخشی استفاده از واژگان کنترل ‌شده در نمایه‌سازی این دو پایگاه

Creating and Querying an Integrated Ontology for Molecular and Phenotypic Cereals Data

Thesaurus-Based Software Environments

Prediction of global sea cucumber capture production based on the exponential smoothing and ARIMA models

عنوان ژورنال:

اشتراک گذاری